Training Quantized Nets: A Deeper Understanding

نویسندگان

Hao Li

Soham De

Zheng Xu

Christoph Studer

Hanan Samet

Tom Goldstein

چکیده

Currently, deep neural networks are deployed on low-power portable devices by first training a full-precision model using powerful hardware, and then deriving a corresponding lowprecision model for efficient inference on such systems. However, training models directly with coarsely quantized weights is a key step towards learning on embedded platforms that have limited computing resources, memory capacity, and power consumption. Numerous recent publications have studied methods for training quantized networks, but these studies have mostly been empirical. In this work, we investigate training methods for quantized neural networks from a theoretical viewpoint. We first explore accuracy guarantees for training methods under convexity assumptions. We then look at the behavior of these algorithms for non-convex problems, and show that training algorithms that exploit high-precision representations have an important greedy search phase that purely quantized training methods lack, which explains the difficulty of training using low-precision arithmetic.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Deeper Understanding of Training Quantized Neural Networks

Training neural networks with coarsely quantized weights is a key step towards learning on embedded platforms that have limited computing resources, memory capacity, and power consumption. Numerous recent publications have studied methods for training quantized networks, but these studies have been purely experimental. In this work, we investigate the theory of training quantized neural network...

متن کامل

Model compression as constrained optimization, with application to neural nets. Part II: quantization

We consider the problem of deep neural net compression by quantization: given a large, reference net, we want to quantize its real-valued weights using a codebook with K entries so that the training loss of the quantized net is minimal. The codebook can be optimally learned jointly with the net, or fixed, as for binarization or ternarization approaches. Previous work has quantized the weights o...

متن کامل

A Parameterized Complexity Analysis of Generalized CP-Nets

Generalized CP-nets (GCP-nets) allow a succinct representation of preferences over multi-attribute domains. As a consequence of their succinct representation, many GCP-net related tasks are computationally hard. Even finding the more preferable of two outcomes is PSPACE-complete. In this work, we employ the framework of parameterized complexity to achieve two goals: First, we want to gain a dee...

متن کامل

Square nets of tellurium: rare-earth dependent variation in the charge-density wave of RETe3 (RE = rare-earth element).

The distortions in the square tellurium nets long known to exist in the structures of the charge-density wave materials, RETe3, have been elucidated. The (Te2)1- nets contain polytelluride oligomers which propagate in a fashion incommensurate from the adjacent (RETe)+ sublattice. The new information sets the stage for a much deeper understanding of these systems.

متن کامل

FitNets: Hints for Thin Deep Nets

While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and fast-to-execute models, and it has shown that a student network could imitate the soft output of a larger teacher network or ensemble of networks. In this pa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Training Quantized Nets: A Deeper Understanding

نویسندگان

چکیده

منابع مشابه

Towards a Deeper Understanding of Training Quantized Neural Networks

Model compression as constrained optimization, with application to neural nets. Part II: quantization

A Parameterized Complexity Analysis of Generalized CP-Nets

Square nets of tellurium: rare-earth dependent variation in the charge-density wave of RETe3 (RE = rare-earth element).

FitNets: Hints for Thin Deep Nets

عنوان ژورنال:

اشتراک گذاری